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on three inductive reasoning tasks: analogies, series completions, and 
classifications. In Exneriment 1, 30 subjects completed an untimed pencil- 
and-paper test in which they were asked to solve 90 induction items, equally 
divided among the three kinds of induction items noted above. The subjects' 
task was to rank-order four response options in terms of their {»oodnes» of 
fit as completions for each particular item. Data sets for the three tasks 
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ware highly correUted, suggesting the possibility of a common model of re- 
sponse choice across tasks. Moreover, a single exponential model of response 
choice provided a good fit to each data set, The single parameter 'estimate 
for this model was roughly comparable across tasks. In Experiment 2, 36 sub- 
jects completed a timed tachistoscopic test in which they, too, were asked 
to solve 90 induction items, equally divided among the three kinds of induc- 
tion items noted above. The subjects' task was to choose the better of two 
response options as a completion for each particular item. Data sets for the 
three tasks were again highly intercorrelated, suggesting the possibility of a 
common model of real-time information processing across tasks. Moreover, a 
single linear model of response times provided a good fit to each data set. 
Three of four parameter estimates were roughly comparable across tasks. It 
was concluded that a common model of response choice and of information proces- 
sing can account for at least some of the previously observed relationships 
in performance across induction tasks. The implications of these findings for 

psychometric and information-processing accounts of induction and intelligence, 
are discussed. 
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Abstract 

1 

Two experinents sought to discover sources of conununalities in performance 
on three inductive reasoning tasks: analogies, series completions, and classi- 
fications. In Experiment 1, 30 subjects coBpleted an untiraed pencil -and-paper 
test in which they were asked to solve 90 induction items, equally divided 
among the three kinds of induction items noted above. The subjects' task was 
to rank-order four response options in terms of their goodness of fit as comple- 
tions for each particular item. Data sets for the three tasks were highly 
intercorrelated, suggesting the possibility of a common model of response choice 
across tasks. Moreover, a single exponential model of response choice provided 
a good fit to each data set. The single parameter estimate for this model was 
roughly comparable across tasks. In Experiment 2, 36 subjects completed a timed 
tachistoscopic test in which they, too, were asked to solve 90 induction items, 
equally divided among the three kinds of induction items noted above,' The sub- 
jects' task was to choose the better of two response x)ptions as a completion for 
each particular item. Data sets for the three tasks were again highly inter- 
correlated, suggesting the possibility of a common model of real-time information 
processing across tasks. Moreover, a single linear model of response times pro- 
vided a good fit to each data set. Three of four parameter Mtlmates for this model 
were roughly comparable across tasks. It, was concluded that a common model of 
response choice and of information processing can account for at least some of 
the previously observed relationships in performance across induction tasks. The 
iaplications of these findings for psychometric and information-processing accounts 
of induction and intnlligence are discusss^d. 

U 
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Unities in Inductive Retsonlng 

Inductive reasoning requlreLS an individual to reason froa part to whole, or 
froa particular to general (Webster's New Collegiate Di ctionary , 1976), Inductive 
reasoning probleas can be of various kinds. One of the most Interestinc; kinds is 
that which Greeno (1978) refers to as problems of inducing structure. Problerfis 
of this kind Include analogies (e.g., LAWYER is to CLIENT as DOCTOR is to (a) 
PATIENT, (b) MEDICINE), series completions (e.g.. Which word should come next 
in the following series? PENNY, NICKEL, DIME, (a) COIN, (b) QUARTER), and clfissl- 
fications (e.g.. Which of the two words at the right fits better with the three 
words at the left? CAT, MOUSE, LION, (a) SQUIRREL, (b) EAGLE). These problems 
are of particular Interest because the/ have played a key role in both the psyche 
aetric and information-processing literatures on reasoning and intelligence, as well 
as in the recent literature attempting to integrate the psychometric and Information 
processing approaches. 

In the psychometric literature, problems of inducing structure have been con- 
sidered Important because they provide particularly good measures of general intel- 
ligence, or g^. Factor analyses of multiple ability tests often yield a "general 
factor," or single source of individual differences, that permeates the entire range 
of tests (Spearman, 1927) . Even when this general factor does not appear from an 
immediate factoring of the tests (Thurstone, 1938), it often appears whv*n factors 
that do not include a general factor are themselves factored (Humphreys, 1962). 
When correlations are computed between individual tests and the general factor, 
probleas of inducing structure usually show some of the highest correlations. 
Moreover, scores on these problems are highly correlated among themselves (see 
Cattell, 1971; Horn. 1968). 
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These results suggest that some common source of individual differences underlies 
perfomance on these various kinds of problems. The importance of this connion 
source of individual differences to psychometric theory is shown by the fact 
that the concept of g has played a prominent part in several major psychometric 
theories of intelligence (e.g., Burt, 1940; Horn 5 Cattell, 1966; Humphreys, 
1962; Spearman, 1927; Vernon, 1971), and by the fact that the problems of in- 
ducing structure are such good measures of g that they are found on an over- 
whelming majority of psychometric intelligence tests. 

In the information-processing literature, problems of inducing structure 
have been considered important because the underlying processes involved in solving 
these problems seem to be so basic to human cognition, both in laboratory and 
real-world settings. These problems have served as the bases for a number of 
task analyses. Several computer programs, for example, have been devoted ex- 
clusively to the solution of analogies (Evans, 1968; Reitman, 1965) or series 
completions (Simon ^ Kotovsky, 1963), and other computer problems have dealt 
with analogies or series completions, among other kinds of problems (Williams, 
1972; Winston, 1974). Analogy problems have been studied experimentally in a 
number of Investigations with human subjects (e.g., Mulholland, Pellegrino, 
I Glaser, in press; Sternberg, 1977a, 1977b; Whitely § Barnes, 1979), as have 
series completions (e.g., Holzman, Glaser, § Pellegrino, 1976; Kotovsky § Simon, 
1973), We are unaware of any previously published information-processing analy- 
ses of the classification task, although Pellegrino and his colleagues are cur- 
rently studying this task (Pellegrino, Note 1), and the literature on concepts and thei 
attainment can be viewed as indirectly studying this sort of task (e.g., Bruner, 
Goodnow, S Austin, 1956; Rosch, 1978), 

Several information-processing psych'>logist.s have claimed that the high 

Q intercorrelations obtained between subjects' performances on various kinds of 
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problems of inducing structure ire attributable to communal i ties in infomatipn 
processing across the various problem types (e.g., Greeno, 1978; Pellegrino § 
Glaaer, 1979, in press; Sternberg, 1977b, 1979). The investigations reported 
here represent what we believe is a first attempt to demonstrate these commu- 
nalities in information processing experimentally. To the extent that the 
investigations are successful, they offer the promise of illuminating at least 
some sources of the mysterious g factor that has been obtained in numerous 
psychometric inquiries into the nature of human intelligence. 

Two experiments were conducted. The first experiment sought to demonstrate 
that a common model could account for response choices in the solution of analo- 
gies, series completions, and classifications. The second experiment sought 
to demonstrate that a common model could account for real-time information 
processing in the solution of these three kinds of problems. 

EXPERIMENT 1 

Our proposed model of response choice in inductive reasoning of the sort 
required by analogies, series completions, and classifications, is an extension 
of the Ruraelhart-Abrahamson (1973) model, of response choice in analogical reason- 
ing, Rumelhart and Abrahamson defined reasoning as the set of thought processes 
in information retrieval that operates upon the structure, as opposed to the 
content, of organized memory. If information retrieval depends upon specific 
content stored in memory, then retrieval is referred to as "remembering." If, 
however, information retrieval depends upon the form of one or more relationships 
uong words, then it is referred to as "reasoning." 

Pursuing this definition of reasoning, Rumelhart and Abrahamson claimed that 
probably the simplest possible reasoning task is the judgment of the similarity 
or dissimilarity between concepts. They assumed that the degree of similarity 
between concepts is not directly stored as such, but is instead derived from 
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prevleusly existing aemory structures. Judged similarity between concepts is' 
a iliple function of the "psychological distance" between these concepts in the 
•eaory structure. The nature of this function and of the memory structure upon 
which it operates is clarified by their assumptions (after Henley, 1969) than 
(a) the lemory structure may be represented as a multidimensional Euclidean space 
And that (b) Judged similarity is inversely related to distance in this space. 

On this view, analogical reasoning may be considered a kind of similarity' Judgment 
betvcen concepts, one in which not only the magnitude of the distance but also 
the direction is of importance. For example, we would ordinarily interpret the 
analogy problem, A : B C : X^, as stating that A is similar to B in exactly 
the same way that C is similar to X^. According to the assumptions outlined 
above, we wight reinterpret this analogy as saying that the directed or vector 
distance between A and IB is exactly the same as the vector distance between C 
and The analogy is imprecise to the extent to which the two vectoj- distances 
are not equal. 

Runelhart and Abrahamson formalized the assumptions of their model by stating 

that given an analogy problem of the form A : B : : C : (X, . X,,. . . . . X ) , It Is 

1 2 n 

assumed that 

I Al. Corresponding to each element of the analogy problem there is a point 
In an a-dimensional space.... 

A2. For any analogy problem of the form A : B : : C : ?, there exists a 
concept I such that A : B C : I and an ideal analogy point, denoted I such 
that I is located the same vector distance from C as B is from A. The coordi- 
nates of are given by the ordered sequence (c, ♦ b, - a,}, j * 1, m. 

M, The probability that any given alternative X, is chosen as the best 
! - 

janilogy solution from the set of alternatives X,, X is a monotonia de- 

creasing function of the absolute value of the distance betwetn the point X. 

10 -i 
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|tnd the point I^, denoted - IJ . (p. 4) 

The first tssunption simply states that the concepts corresponding to the 
•leaents of the analogy exist and are locatable within the n-dinensional space 
represei\ting the nenory structure. The second assumption states that an ideal 
solution point also exists within the nenory structure, and that this point 
tlso represents a concept; it is quite likely that no real-world concept will 
correspond to this ideal point, so that the ideal point may not have a named, 
concept in the English (or any other) language. The third assumption states that 
the selection of a correct answer option is goveined by the distance between 
the various answer options and the ideal point, such that less distant answer 
options are selected more often than are more distant answer options. 

These assumptions permit ordinal predictions about the goodness of the 
various answer options, but do not permit quantitative predictions. In order 
to make quantitative predictions of response choices, ROrae'lhart and Abrahamson 
made assumption 3 more specific, and added two more assumptions: 

3'. The probability that any given alternative is chosen from the set 
of alternatives Xj,....X is given by Pr(X |X ,...,X ) • p, » v(d.) /[? v(d.)], 
where d^ ■ |X^ - I^| denotes the absolute value of the distance between X. and I, 
and vC ) is a nonotonically decreasing function of its argument. 

4. v(X) - exp(-aX), where X and o are positive numbers. 

5. We assume that the subjects rank a set of alternatives by first choosing 
the Rank 1 element according to 3' and, then, of the remaining alternatives, 
deciding which is superior by application of 3' to the remaining set and assign- 
ing that Rank 2. This procedure is assumed to continue until all alternatives 
are ranked, (pp. 8-9) 

The more ,«pecific version of assumption 3 (labeled 3') is an Adaptation of 
Luce's (1959) choice rule to the choice situation in the analogy. Assumption 4 
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further specifies that the aonotone decrease in the likelihood of choosing t par- 
ticular answer option A? best follows an exponential decay function with increasing 
distance from the ideal point. The nodel of response choice therefot«V ripquires 
a single parameter, a, representing the slope of the function, Ruraelhart and 
Abrahaason actually had their subjects rank-order answer options. The investi- 
gators predicted the full set of rank orderings by assuming (in assumption 5) that 
once subjects had ranked one or more options, they would rank the remaining options 
in exactly the same way that they had ranked the previous options, except that they 
would ignore the previously ranked options in making their further rankings. 

Runelhart and Abrahamson (1973) carried out three ingenious experiments to 
test their model of analogical reasoning, using Henley's (1969) mammal-name space 
of three dimensions (size, ferocity, huraanness) as a basis for representing informatior 
about the mammals. The first experiment set out to show that subjects rank-order 
options in accordance with the assumptions outlined above. Subjects rank-ordered 
options in 30 analogy problems using mammal names as analogy terms. The second experi- 
ment set Cut to show that the response distribution should depend upon the ideal solu- 
tion point and upon the alternative set, but not upon the terms of the particular analc 
gy problem. Twelve analogy pairs were constructed that had the same ideal points withi 
a tolerance of .12 scaled units (roughly the distance between a lion and a tiger), and 
in which the ith closest alternative for one set was at about the same distance from 
ideal point as the ith closest alternative for the other set. There were no overlappiT 
analogy terms across paired items, however. The third experiment set out to show that 
if the ideal point of an analogy is given a name corresponding to an imaginary mammal, 
subjects will use the newly named mammal in the same way in solving analos;ies that 
they use the names of actual mammals. Subjects were taught the meanings of three 
imaginary mammals, and were then asked to perform some tasks that tested their under- 
standing of the properties of these imaginary mammals. The experiments were generally 
supportive of the Rumelhart-Abrahamson model, and the results led the authors to 
conclude that at least for those portions of semantic memory that are representable 

is 

as multidimensional ^mantic spaces, the proposed model of analogical rpasoning 
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provides a good account of response choices. 

We propose a nodest extension of the Rune 1 hart -Abrahanson model so that 
it can account for response choices in series completion and classification 
problems as well as in analogy problems. Figure 1 shows how the extended model 
accounts for response choices in each of the three types of problems. 

Insert Figure 1 about hare 

Consider an analogy problem of the form, A : B :: C : (Dj, Dj, Dj, D^), e.g., 
TIGER : CHIMPANZEE :: WOLF : (1. RACCOON, 2. CAMEL, 3. MONKEY, 4. LEOPARD), where 
the subject's task is to rank-order the answer options in terms of how well their 
relation to WOLF is parallel to that between CHIMPANZEE and TIGER. In an analogy 
problem such as this one, the subject must find an ideal point, I_, that is the 
same vector distance from WOLF as CHIMPANZEE is from TIGER. Having found this . 
point, the subject rank-orders answer options according to their overall Euclidean 
distance from the ideal point. The probability of selecting any one answer option 
as best is assumed to follow an exponential decay function, with probability de- 
creasing as distance from the ideal point increases. The same selection rule is 
applied in rank-ordering successive options, with previously selected options 
removed from consideration. 

Consider next a series completion problem of the form, A : B : (C^, C2, Cj, C^), 
e.g., SQUIRREL : CHIPMUNK : (1. RACCOON, 2. HORSE, 3. DOG, 4. CAMEL), where the 
subject's task is to rank-order the answer options in terms cf how well they 
complete the series carried from SQUIRREL to CHIPNfUNK. Here, the subject must find 
an ideal point, that is the same vector distance from CHIPNIUNK as CHIPMUNK is from 
SQUIRREL. Note that the difference between a series completion problem and an analo- 
gy is that whereas the terms of an analogy form a parallelogram (or its ra-dimensional 
analogue) in the multidimensional space, the terTc«; of a series completion form a 
line segment (or it» m-dimensional analogue) in the space. The same principle would 

*PPlyi regardless of the number of terms in the item stem. Having found the ideal 

15 
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point) the subject rank-orders answer options with respect to the ideal point in 
just' the sane way that he or she would in an analogy problem. 

Consider finally a classification problem of the form, A, B, C, (Dj, Dj, D3, D 
•.g., ZEBRA, GIRAFFE, GOAT, (1. DOG, 2. COW, 3. MOUSE, 4. DEER), where the subject' 
task is to rank-order the answer options in terms of how well they fit with the 
three terms in the item stem. In this type of problem, the subject must find an 
ideal point, I_, that represents the centroid in multidimensional space of ZEBR.\, 
GIRAFFE, and GOAT. Having found this point, the subject rank-orders the answer 
options according to their overall Euclidean distance ^from the ideal point, in 
just the same was as he or she would for analogies or series completions. Again, 
the same basic principle applies without regard to the number of terms in the item 
stem. The centroid of the points is theorized always to serve as the ideal point. 

In Experiment 1, subjects were presented ."^O analogies, 30 series completions, 
and 30 classifications (in an order counterbalanced across subjects). The subjects 
task was to rank-order the goodness of four alternative answer options in terms of 

f 

their appropriateness as completions to the problem stems. The task uas untimed. 

Method 

Subjects 

Thirty college-age adults from the New Haven area--l9 women and 11 men-- 
participated in the experiment. Subjects received either pay at the rate of \'2,50 
per hour, credit toward fulfillment of an introductory psychology course require- 
aent at Yale, or some combination of the two. 
Materials 

Problems of all three types--analogies , series completions, classif ications-- 
were composed of mammal names from the set multidimensionally scaled by Henley 
(1969). There were 50 problems of each type, and each drew upon the set of 30 
names in the following ways. 

Analogies were taken from Experiment 1 of Rumelhart and Abraharason (1973),^ 
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All 30 Analogy problems were of the fom, A : B C : (D^, D^, Dj, D^), for example 
TIGER : CHIMPANZEE :{ WOLF t (l. RACCOON, 2. CAMEL, 3. MONKEY, 4. LEOPARD),^ 
Runelhart and Abrahamson constructed their analog/ stems (A : B C) by sampling 
without replacement from the pool of 30 mammal names until all of the terms 
were exhausted, and then by replacing the entire pool. The first term sampled 
bec'ime the A term of the first problem* the second term became the B term of the 
first problem; the third terra became the C terra of the first problem; the fourth 
term became the A term of the second problem; etc. This selection procedure con- 
tinued until 30 unique analogy stems were formed. At this point, answer options 
were selected with the following constraints: (a) one option was within .5 
scaled units of distance from the ideal point; (b) a second option was between 
,5 and 1.0 scaled units from the ideal point; (c) a third option was between 1.0 
and 1,5 scaled units from the ideal point; (d) a fourth option was more than 1.5 
Scaled units from the ideal point; additionally, answer options were not permitted 

I 

to overlap with mammal names in the analogy stem. If any of these constraints 
could not be satisfied, a new unique stem was formed, and the constraints were 
again applied to the new analogy. This process was repeated until 30 acceptable 
analogies had been constructed. 

Series completion problems were of the form, A : B : (C^, C^, C^, C^), for 
example, SQUIRREL : CHIPMUNK : (1.. RACCOON, 2. HORSE, '3. WG, 4. CAMEL).' The 
series completion problems were generated in a manner that was isimilar to that 
used for the analogy problems. Ml possible combinations of two of the mammal 
terns were produced, and an ideal point for each of these pairs of terras was 
calculated. This procedure yielded a total of 870 possible series completion 
problems. Problems were selected from this set at random until 30 acceptable 
items were produced. The constraints for acceptability were that (a) the ideal 
point lie within the (arbitrary) boundaries of the multidimensional space pro- 

Vo 
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duced by Henley»s (1969) scaling and that (b) four suitable answer options could 
be selected according to the constraints used for the production of answer 
options in the analogy problems. 

Classification problems were of the form, A, B, C, CD , D., D , D ), for 

12 3 4 

exaonle:» ZEBRA, GIRAFFE, GOAT, (1. DOG, 2. COW, 3, MOUSE, 4. DEER)/ Classifica- 
tion problem stems were sampled from all possible triplets of mammal names with 
geometric perimeters of greater than 1.5 scaled units of distance, but less than 
2,0 scaled units. The perimeter of the "triangle" of terms in the multidimensional 
space was constrained in this way so that problems would be neither too easy nor 
too difficult: Very small perimeters (as would be obtained for TIGER, LION, 
LEOPARD) led to problems that were exceedingly easy, whereas very large perimeters 
(as would be obtained for GORILLA, SQUIRREL, CAMEL) led to probl«ms that were 
exceedingly difficult. The permissible range for acceptable triples was rather 
small, and forced modification of the constraints used in selecting answer options 
so that a sufficient number of acceptable items could be produced. The new con- 
straints were that (a) on( a.viwer option be within .4 scaled units of distance 
from the ideal point; (b) a second answer option be between .4 and .8 scaled units 
from the ideal point; (c) a third answer option be between .8 and 1.2 scaled units 
from the ideal point; and (d) a fourth answer option be at greater than 1.2 scaled 
units from the ideal point; answer options were not allowed to overlap in content 
with stem terms. Two hundred eighty-two clrssification problems satisfied all 
of the above constraints. These were sampled from randomly without replacement 
to produce 30 classification problesas for the experiment, 

A standardized test of general intelligence. Forms A and B of the Culture 
Fair Test of g. Level 5 (Cattell 5 Cattell, 1963), was also administered to all 
subjects. The test contained four types of inductive-reasoning problems-- figural 
series completions, figural classifications, figural matrix problems, and figural 
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topological reasoning problems. 
Prostdure 

Subjects were tested in groups of from one to nine members. After signing 
consent forms, subjects read instructions silently for the three tasks combined 
while the experimenter read them aloud. Instructions made clear the natures of 
the three different tasks, and no subject expressed any questions about how each 
of the three tasks operated, or about how they differed from one another. Subjects 
then solved the 90 test problems, which were presented In paper-and-pencil format. 
The subjects' task was to rank-order the answer options from best to worst; 
Subjects were allowed as much time as they needed to finish; this time period 
never exceeded IH hours. Subjects who finished before others in their group were 
given an irrelevant filler task. When all subjects had completed the test items, 
they received the standardized intelligence test. After the test was completed (in 
roughly 45 minutes), subjects were debriefed and compensated for their participation. 
Design 

The main dependent variable was proportion of subjects choosing each possible 
response as first, second, third, and fourth best. The independent variable used 
to predict these proportions was distance of each option from the ideal point. 
One parameter, a, was estimated for the predicted negative exponential function. 
Problems were blocked into sets of 30 analogies, 30 series completions, and 30 
classifications, presented in counterbalanced order across subjects such that 
five subjects received each of the six possible orders. Items within a block 
were presented in a different random order to each subject, and answer options 
within each item were also presented in a different random order to each sub- 
ject. The standardized intelligence test was scored for nuriber of items coti- 
pleted correctly. 
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Results 

Besic Data Sets 

• -OTawMPM WMwmaM ^^bsmwm 

The basic data sets for the present experiment, as well as for Experiraent 
1 of 'Runelhart and Abrahainson (1973), are shovm in Tabls 1. This table shows 

Insert Table 1 about here 

the proportions of subjects ranking each answer option as first, second, third, 
or fourth best, as a function of that option's distance from the ideal point. 
First, it is '.forth noting that the pattern of response choices in the 16 
cells for the present analogy data closely replicate those in; the 16 cells for 

•Ml 

the Rumelhart-Abraharason analogy data, r ■ .99-»-, RMSD (root-mean-square deviation) 
■ .02. Second, the patterns of response choices across the three tasks in the 
■present expariraen'. are highly similar: For analogies and series completions, 
r ■ ,99, RMSD ^t .03; . for analogies and classifications, r « .97, RMSD,« .05; for 
series completions and classifications, r « .98, RMSD « .04. These high levels 
of similarity in response choices are consistent with the notion that a single 
model of response choice is used in all three tasks. Further analysis is needed, 
however, to test our proposal for what this model is. 
Tests of Model of Response Choice 

The value of a was estimated as 2.52 for the analogies, 2.56 for the series 
completions, and 2.98 for the classifications. Although these values differ sig- 
nificantly from each other, F(2,48) ■ 3.73, p < .05, they are certainly in the 
same ballpark, and the most extreire value corresponds roughly to that obtained 
by Rumelhart and Abrahamson for their analogies, 2.9l. The somewhat discrepant 
value, that for classifications, was obtained in the task for which slightly dif- 
ferent constraints were set on the distances of answer options from the ideal 
point, which, conceivably, might have been partly responsible for the discrepancy. 

18 
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We believe that the three values are close enough to suggest that the decision 
rules used for rank-ordering options in each of the three tasks are extremely 
siailar. if not identical. 

The fits of the exponential model to the three data sets of the present 
experiraent plus that of Ruraelhart and Abrahamson (1973, Experiment 1) are shown 
In Figure 2, which compares predicted response choic as to observed response 
choices. For analogies, r ■ .97, RMSD « .OS; for series completions, r « .98, 



Insert Figure 2 about here 



RMSD • .04; for classifications, r « .99, RMSD « .03. Although the residual 
variance was small in each case, at least part of it was highly systematic. 
Residuals were correlated across task: The product -moment correlations were .83 
between predicted minus observed values for analogies and series completions, 
,80 between residuals for analogies and classifications, and .95 between residuals 
foi: series completions and classifications. A visual inspection of the residuals 
revealed to us at least some of the systematic trends. Prediction of first-choice 
data tended to be best, as expected, since a was estimated on the basis of the 
first-choice data. Also, the proportion of subjects predicted to 

choose the best option as second best was overestimated in all three tasks, while 
t'le proportion of subjects predicted to choose the best option as third or fourth 
best was underestimated in each case. Responses to the best option were thus 
■ore spread out than was predicted by the model perhaps because individual dif- 
ferences in perceptions of distance increase as distance increases. 
Individua l Differ ences 

Analyses of individual differences were disappointing: Values of a were 
not significantly correlated with each other across tasks, nor were they signifi- 
cantly correlated with scores on the Cattell Culture Fair Test of g. Several 
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Other indices of overall performance on the. three kinds of inductive reasoning 
tasks were also computed, but these did not correlate with each other or with 
the ability test. The low correlations presumably reflect the low reliability 
of the inductive-reasoning task scores for individual subjects, which for pro- 
portions of items answered correctly were .24 for analogies, .59 for series 
completions, and .30 for classifications. 

Discussion 

The results of this experiment suggest that one coraaunality in performance 
tcross analogies, series completions, and classifications is in the model of 
response choice subjects use in rank-ordering the goodness of alternative answer 
options. The same model seemed to be used in each of these three tasks, and 
even the value of the exponential response-choice parameter seemed to be about 
the sane in each experiment, and in close agreement with that obtained by 
Runelhart and Abrahamson (1973) in their study of response choice in analogical 

» 

reasoning. Although the proposed model of response' choice provided a good fit 
to the response-choice data, the residual variance was largely systenatic, sug- 
gesting that the proposed model did not capture all systematic features of the 
subjects' decision rule. The model also, of course, does not explain how subjects 
got to the poi.it where they could rank-order the answer options. The information- 
processing model described below seeks to provide such an explanation. 

EXPERIME.VT 2 

Our proposed model of real-time information processing in inductive reason- 
ing of the sort required by analogies, series completions, and classifications, 
is an extension of the Sternberg (1977a, 1977b) model of information rfocessing 
in reasoning by analogy. In this model, reasoning is viewed as involving (a) 
selection and execution of a set of components for solving reasoning problems, 
(b) selection and execution of a strategy for combining these components, (c) 
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selection tnd utilization of an internal representation for Information upon which 
the components tnd strategy act, (d) selection and oaintenance of a speed-accuracy 
tradeoff whereby components are executed at a rate that produces an acceptable 
level of accuracy in performance, and (e) aonitoring of one's decisions and solu- 
tion processes to assure that information processing is leading toward an accep- 
table solution to the problem at hand. 

Response time in reasoning is hypothesized to equal the sam of the amounts 
of time spent on the various information-processing components used in problem 
solution. Hence, a simple linear mod^l predicts response time to be the sum 
across the different components of the number of times each com^jonent operation 
is performed (as an independent variable) multiplied by the duration of that com- 
ponent operation (as an estimated parameter). Proportion of response errors is 
hypothesized to equal the (appropriately scaled) sum of the difficulties encoun- 
tered in executing each component operatic'n. A simple linear model predicts 
proportion of errors to be the sura across the different component operations of the 
number of times each component operation is performed (as an independent variable) 
multiplied by the difficulty of that component operation (as an estimated parameter). 
Thi^ additive combination rule is based upon the assumption that each subject has 
a Unit on processing capacity (or space; see Osherson, 1974). Each execution of 
an operation uses up capacity. Until the limit is exceeded, performance is flaw- 
less except for constant sources of error (such as motor confusion, carelessness, 
momentary distractions, etc.). Once tiie limit is exceeded, however, performance 
is at a chance level (Sternberg, 1977a). 

Consider as an example the analogy, TIGER : CHIMPANZEE :: WOLF : (1. CAMEL, 
VDMKEY), According to the theory, a subject encodes each terra 

of the analogy, retrieving from semantic memory and placing in working memory 
the locations in semantic space of the terms of the problem; next, the subject 
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Infers the relation between TIGER and CHIMPANZEE, recoj?niiing the vector distance 
between these first two terras of the analogy; then, the subject maps the higher- 
order relation between the first and second halves of the analogy, here recog- 
nising the vector distance from the terra heading the first half (TIGER) to the 
terra heading the second half (WOLF); next, the subject applies the relation in- 
ferred between the first two terms from the third analogy terra, here, WOLF, to 
fom an ideal point representing the ideal solution to the analogy; then, the 
subject compares answer options, seeking the ideal solution from among the answers 
presented;^ if none of the answer options corresponds to the ideal point, the subject 
must ju stify one of the answer options as preferable to the bther(s), in that it 
is closest to the ideal point (MONKEY is closer than CAMEL); finally, the subject 
respond s with the chosen answer, MONKEY. 

How does the information-processing model described above interface with the 
nodel of response-choice described earlier? Essentially, the exponential response- 
choice parameter of the response-choice nodel quantifies the decision rule used 
during justification of one response as superior to the others: It represents an 
end-product of the series of reasoning components. In a rank-ordering task, the sub- 
ject applies justification repeatedly, successively assigning rank i^ to the alterna- 
tive that is ith closest to the ideal point. Errors within and between subjects in 
the calculiition of distances between the ideal point and the various answer options, 
as well as differences in placements of points in the mammal-name space, lead to 
intra- and inter-individual differences in assignments of ranks in accordance 
with the exponential function. 

The sane basic raodel can be extended to series completion problems. Consider, 
for example, the series completion, SQUIRREL : CHIPMUNK : (1, RACCOON, 2. HORSE). 
The subject oust encode each term of the series completion. Next, he or she infers 
the relation of succession between SQUIRREL AND CHIPMUNK. Mapping is not 
necessary in this and other series problems, because there is no distinction 
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geneous domain: Geometrical ly» there is no realignment of vectors from one area of 
the space (A:B) to another (CtT), The subject miist, however, apply the relation 
inferred between SQUIRREL and CHIPMUNK from CHIPMUNK to an ideal point. Next, the 
subject CO pares the answer options, seeking the one corresponding to the ideal 
point. If neither option (or in the case of more than two options, none, of the 
options) corresponds to the ideal point, the subject justifies one option as closer 
or closest to the ideal point. In the present example, RACCOON is closer to the 
ideal point than is HORSE. Finally, the subject responds with the chosen answer. 
ks in the case of analogies, the rank-ordering task would require multiple justifica 
tions to determine which option is closest to the ideal point, of those options 
not yet ranked. 

The model can also be extended to classification problems. Consider, for 
example, the problem, ZEBRA, GIRAFFE, GOAT, (1, COW, 2. DOG). The subject 
must encode the terms of the problem. Next, the subject must infer what is 
common to ZEBRA, GIRAFFE, and GOAT, in essence seeking a prototype or centroid 
•that abstracts what is common to the three terms; as was the case in the series- 
completion problems, the subject need not map any higher-order relation, since all 
of the terms of the problem are from a single, homogeneous domain. In clas- 
sification problemr, application is also unnecessary, because the inferred centroid 
is the ideal point: The subject need not extrapolate in any way to seek some 
further point in the multidimensional semantic space. Next, the subject compares 
the answer options, seeking the ideal solution. If none is present, the subject 
justifies one option as closer to the ideal point than the other(s). Finally, 
the subject responds . As in the case of analogies and series completions, rank- 
ordering the options requires multiple executions of the justification component* 
Rankin2 in these problems and in the series completions proceeds according to the 
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decision rule described in Experinent 1. 

Whereas the sane single parameter of response choice applies in all three 
inductive reasoning tasks, the parameters of information processing in the three 
tasks are slightly different: The analogies task requires the full set of seven 
infomation-processing parameters; the series completion task requires a subset 
of six of the seven parameters in the analogies task; the classification task 
requires a subset of five of the six parameters in the series completion task. 
Thus, one would expect that for problems with terras of equal difficulty, analogies 
would be slightly more difficult than series completion problems, and series 
completion problems would be slightly more difficult than classification problems. 

In Experiment 2, subjects were presented with 30 analogies, 30 series comple- 
tions, and 30 classifications (in an order counterbalanced across subjects). The 
subjects' task was to select the better of two alternative answer options in terras 
of its appropriateness as a completion to the problem stem. The subjects were timed 
as they solved each item. 

Method 

Subjects 

Thirty-six college-age adults from the New Haven area-- 17 women and 19 men- 
participated in the experiment. Subjects received pay at the rate of $2.50 for 
their participation in the experiment. 
Materials 

The 30 analogies, 30 series completions, and 30 classifications had the sane 
items as the items in Experiment 1. They differed from the items in Experiment i, 
however, in having two rather than four answer options. The two options in each 
problem were randomly chosen with the constraint that each possible pairing of 
options (best with second best, best with third best, best with worst, second best 
with third best, second best with worst, third best with worst) 
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be equally represented. Since there were six possible 
kinds of pairings, five items of each of the three problem types contained one 
of the kinds of pairings. 

Three standardize^i ability tests were used in this experiment. Like the 
test in the first experiment, the items required inductive reasoning of the kind 

required for solution of the items in the main part of the experiment. The 

\ 

test* were figural classifications and figural analogies from Form I of the 
Cognitive Abilities Test, Level H (Thomdike § Hagen, 1971), 
^ and figural analogies from Form T of the Differential Aptitude Test (Bennett, 
Seashore, 5 Wesman, 1972). 
Procedure 

Subjects were tested individually. Stimulus items were presented tachisto- 
scopically, and response latencies were timed via an attached millisecond clock. 
The stimuli were presented via the method of precueing (Sternberg, 1977b, 197S), 
whereby each stimulus trial is divided into two parts. In the first part -of the 
trial, the subject received some amount of precueing; the subject was told to 
take as long as he or she needed to process the advance information fully, but 
no longer. In the second part of the trial, the subject received the full stimu- 
lus item; the subject was told to solve the item as quickly as possible, using 
whatever information may have been gleaned from the first part of the trial, 
without making an error. There were two conditions of precueing: In an uncued 
condition, the first part of the trial consisted merely of a lighted, blank 
field; in a cued condition, the first part of the trial consisted of either the 
first two terms of the item (analogies and series completions) or the first three 
terms of the item (classifications). The second part of the trial was always 
presented one second after the subject indicated by pressing a button that he 

or she had con^pleted processing of the first part of the trial, The purpose 

o Or 
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of the precueing was to facilitate separation of paranjeters that would other- 

wise have been confounded: The data of wrinary interest were those from the second 

part of the trial, in which the full item was presented. 

Each item was presented twice, once in the uncued condition and once in the 
cued condition. Items were presented in 12 blocks of 15 items each. Each block 
contained a single type of item (analogies, series completions, or classifications) 
in a single condition of precueing (uncued or cued). Item types alternated across 
successive blocks such that subjects always received one of the six possible permu- 
tations of analogies, series completions, and classifications in three adjacent 
blocks. The same permutation was used repeatedly (four times) for a given subject, 
but permutations were varied across subjects, Precueing conditions alternated 
across successive blocks such that a cued block always followed an uncued. block 
(or vice versa) until the full set of items had been exhausted after six blocks. 
At this point, each item had been presented in one or the other condition of 

\ 

I 

precueing. Then the items were re-presented according to the sane scheme, ex- 
cept that each item was presented in the precueing condition in which it had not 
yet appeared. Testing on the stimulus items consumed roughly 2'i hours. 

The ability tests were administered in pencil-and-paper format in fixed 
order (figural classifications, then figural analogies, then figural series com- 
pletions) upon completion of the tachistoscopic testing. Ability testing took 
about hour^ and was conducted in a separate session. 
Design 

The main dependent variable was response latency to solution in the second 
part of the trial. Error rate served as a subsidiary dependent variable. It 
was assumed that subjects used prior information presented in the cued condition 
to reduce as much as possible the information processing required in the second 
part of the trial. For example, when presented with the first two terms of an 
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Analogy in the first part of the trial i subjects were assuned to encode these 
terns and to infer the relation between them so that they would not have to 
perfora these operations in the second part of the trial. The independent 
variables used to predict response latencies and error rates in the second 
part of the trial for cued items thus took into account information processing 
In the first part of the trial. The independent variables (for both cued 
tnd uncued items) were (a) the number of terms to be encoded; (b) spatial dis- 
tance between A and jS (for analogies and series completions) or the maximum of 
the three distances between A and JB, A and C, and £ and C (for classifications), 
used to estimate inference time and difficulty; (c) spatial distance between 
A and C (for anaJ >gies), used to estimate mapping time and difficulty; (d) spa- 
tial distance between £ and (for analogies) or between B and I (for series 
completions), used to estimate application time and difficulty; (e) spatial 
distance between D and £ (for analogies and classifications) or between C 
and C, (for series completions), used to estimate comparison time and difficulty; 
(f) spatial distance between I_ and ^^y^^ analogies and classifications) 

or between 1^ and C^^y^^(for series completions) , used to estimate justification time 
and difficulty, Motor response tine and difficulty were estimated as regression constan 

Subjects were crossed with precueing conditions and with items such that 
each subject received each item in each condition of precueing. Items within 
a block were presented In a different random order to each subject. Ability 
tests were scored for number of items completed correctly. 

Results 

Basic Statistics 

Table 2 presents mean response latencies for all items (correctly answered 
and incorrectly answered combined) and for correctly answered items only. 



Insert Table 2 about here 
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Recall that according to the proposed infomation-processin? model, errors can 
result when an overflow occurs in the processing capacity or space allocated 
to t given item. When an overflow occurs, some items are answered correctly by 
chance; others are answered incorrectly. An incorrect answer guarantees that 
in overflow has occurred, although a correct answer does not guarantee that one 
has not occurred, since the answer nay have been correct by chance. But the sane 
Information-processing model applies in any of these events. Hence, modeling was 
performed upon all data points. In fact, modeUng correct latencies only would 
have had little differential effect, since, as can be seen in the table, the mean 
values for the two data sets were very close to each other, and since the correla- 
tioa:across item types between the two data sets were very high (.95 for analogies, 
,93 for series completions, and .97 for classifications).^ 

A two-way analysis of variance on all solution latencies revealed a signifi- 
cant effect of task, F(2, 70) . 30.12, £ < .01, and of precueiig. F(l,35) « 145.51 
£ < .01. The interaction was not significant, FC2,70) » .74 , £ > .10. a two- 
way analysis of variance oK fcrror rates also revealed a significant effect of task, 
F(2,70) ■ 6,64, £ < .01, but a nonsignificant effect of precueing condition, F(1,3S) 
■ 1.52, £ > .10. The interaction was nonsignificant, FC2,70) - .29, £ > .10. 

It is of interest to note that analogies were solved most slowly, series 
completions next most slowly, and classifications most rapidly. This is the rank 
or4er of processing time predicted by the model, according to which analogies re- 
quirexone more component than do series completions, which in turn require one 
more component than do classifications, The error rates did not show this pattern: 
AlthougT^ classifications had a lower mean error rate than did analogies, series 
completions had the highest error rate. We are uncertain as to how to account 
for this fi-nding. 
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Task In t<trcorrelations and Factor Structure 

Intercorrelations across subjects were computed b,etween mean response la- 
tencies for each pair of data sets: The correlations were ,85 between analogies 
and series completions, .86 between analogies and classlfi^lons, and .88 be- 
twecn series completions and classifications, A principal -components factor 
analysis of the tliree sets of latencies revealed a strong general factor in 
the first, unrotated principal component, accounting for 91% of the variance in 
the Individual-differences data. Had the tests shown no overlap in individual 
dlfferencei- variavion (zero intercorrelations) , this factor would have accounted 
for only ll\ of thi: *ndividual-differences variation. The data are thus consis- 
tent with the notion that a single real-time information-processing model n^ght 
apply across tasks. \y 

A comparable set of analyses was performed on the ability-test scores: 
Here, the correlations were .72 between analogies and series completions, .45 
between analogies and classifications, and ,65 between series completions and 
classifications. A principal-components factor analysis of the three sets of 
test scores (numbers correct) revealed an unrotated, general first factor ac- 
counting for 74% of the variance in the individual-differences data. Again, 
such a factor would have accounted for only 33% of the individual-differences variation 
If the intertask correlation had been 0. These results, too, are con- 

fistent with the notion of common processes across tasks. Indeed, high correla- 
tions and the resulting strong general factor resulting from sets of ability 
tests like these were the first psychometric clue we had, historically, that 
common processes were involved across inductive reasoning tasks found on intel- 
ligence tests. 

Finally, intercorrelations were computed between task scores across the two 
forms of task presentation (tachistoscopic, leading to response latencies, and 

pencil-and-paper, leading to nuT.bers correct). Correlations across task format 

O v. 
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were lowt. than those within format, as would be expected if there were at least 
son^ Bediuja-specific variance that were not shared across task formats. Such 
■ediun-specific variance aight result from differences across task formats in 
speed-Accuracy tradeoffs, in attentional allocations for items presented singly 
(as in a tachistoscopic task) and for items presented as a group (as in a pencil- 
tnd-paper task), in kinds of strategy or other planning required, or in what is 
■easured by latency and accuracy scores. Most probably, some combination of 
these and other factors was involved. The correlations ranged from -.21 to -.41, 
with a median for the nine intertask correlations of -.35 C£ < .05). Correla- 
tions of tasks with their analogues across formats (e.g., tachistoscopic analo- 
gies with pencil-and-paper analogies) were only trivially higher than correlations 
of nonanalogous tasks across formats (e.g., tachistoscopic analogies-'if^th pencil-'' 
and-paper series completions): The median correlation for analogous tasks was 
-.35 (£ < .05), where?.s the median correlation for nonanalogous tasks was -.30 
(£ < .05). A factor analysis of the six tasks (three tachistoscopic and three 
pencil-and-paper) yielded a first, unrotated principal component accounting for 
57% of the variance in the data. If tests were unccrrelated, a value of 17% would 
have been obtained. The response latencies all loaded in the .80s on this factor, 
whereas the number -correct measures all loaded in the .60s on the factor. The 
higher loadings of the response-latencies would be expected on the basis of their 
higher intercorrelations with each other. As expected, the second unrotated prin- 
cipal component, accounting for 26% of the variance in the data, was a bipolar 
factor distinguishing pencil-and-paper tasks from response-latency ones. The 
general factor unifying the various kinds of tasks was thus about twice as strong 
ts the medium-specific factor dif ferentiatij g the two task formats. Subsequent 
factors were of little interest. 
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Tests of Hodel of Information Processing 

Resppnse latencies . The proposed model of information processing; was fit 
to the res^iDnse latencies for uncued and cued conditions combined (60 data points) 
in each of the three tasks. Four parameters--encoding, comparison, justification, 
and response--could be estimated reliably in each of the three tasks, and hence 
final models were based only upon these four parameters. Other parameters could 
be estimated reliably in some tasks but not others, but since our interest was 
in unities in information processing, these parameters were deleted from the final 
common model. It is impossible to say whether the failure to estimate these 
other parameters reliably was due to insufficient stability of the data or to 
failure of the~full. model to account for information processing in one or more 
tasks. Further analyses will be based only upon the common parameters. 

Table 3 shows parameter estimates for each parameter in each task. If the 

Insert Table 3 about here 

tasks truly involve the same components, then the parameter estimates should be 
equal within a margin of error of estimation across tasks, ^ A one-way analysis 
of variance was conducted across tasks upon each of the four parameter estimates 
of interest. For encoding, F(2,70) =2.81, .05 < £ < .10; for comparison, F(2,70) 

■ ,46, 2,> .10; 'or justification, F(2,70) » 9.88, p < .001; for response, F^(2,70) 

■ 2.48, .05 < £ < .10. Only one of the parameterS"-justi{'ication--showed a clearly 
significant difference in value across the three tasks; two others showed marginally 
significant differences, and one showed no difference at all. These results are 
Interpreted as at least modestly supportive of process equivalence or near-equivalenc 
across tasks, witK^he exception of the result for justification. 

The fits of the^^ib^el to the three sets of data ware assessed by correlating 
predicted with observed values and by calculating the root-raean-square deviation 
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(RMSD) of observed from predicted values. For analogies, r « .88, RMSD • l.lO 
seconds; for series completions, r « .82, RMSD - .92 seconds; for classifications, 
r ■ .78, RMSD ■ 1.09 seconds. The maximum possible values of the correlations 
according to classical \test theory are the square roots of the reliabilities of 
the data sets, which were .96 for analogies, .92 for series completioni;, and 
.93 for classifications, Thus, most but not all of the systematic variance in the 
data was accounted for by the model. Residuals of observed from predicted values 
were correlated for random halves of the subjects, and corrected by the Spearman- 
Brown formula. All correlations were statistically significant (.61 for analogies, 
.64 for series completions, .75 for classifications^:^, indicating that the residual 
variance was indeed quite reliable. At least some of this unaccounted for variance 
was probably attributable to parameters that were not statistically reliable for all 
three tasks, and hence were not included in the final model fitting for any of 
the tasks. Further unexplained variance probably resulted from attributes of mammals 
that are not captured by the three-dimensional spatial representation into which 
the mammals were placed. 

Error rates . Similar modeling was done for error rates, although parameter 
estimates for individuals were extremely unstable, and could not be compared 
across tasks. Fits were assessed by the correlation between predicted and observed 
values and by root-raean-square deviation. For analogies, r « .43, RMSD » ,16; 
for series completions, r ■ .61, RMSD - .15; for classifications, r « .39, RMSD - .16. 
The respective square roots of the reliability coefficients were .92, .92, and .93. 
C orrelations of Lat ency Parameters across Tasks 

Correlations were computed across pairs of latency parameters for individual 
tasks. These correlations were generally positive but statistically nonsignificant, 
perhaps in part because each individual parameter estimate was computed on the basis 
of only a single observation for each of 60 data points (30 uncued and 30 precued). 
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Ditcussion 

The results of this experiment suggest that one coranunallty in performance 
across analogies, series completions, and classifications is in the aodel of 
information processing subjects use in solving the induction problems. The 
same model seemed to be used in each of the three tasks, and the values of the 
real-tine information processing parameters were generally comparable across 
tasks, with the notable exception of justification. Because previous tests of 
the proposed model on analogy problems were conducted upon problems with very 
different types of content, it was not possible to compare parameter estimates 
obtained in this experiment to those obtained in previous work (e.g., Sternberg, 
1977b). Although the proposed model of information processing provided a good 
fit to the latency data and a fairly good fit to the less reliable error data, 
the residual variance was systematic at lekst in part, suggesting that the 

» 

proposed model did not capture all systematic features of the subjects' inforraa- 
tion processing. The model does not, of course, explain how subjects actually 
decide upon their response choices: This function is served by the previously, 
described model of response choice, which fills in the decisions that take place 
during execution of the justification component, 

GENERAL DISCUSSION 

The two experiments reported in this article, taken together, suggest that 
the high intercorrelations found in the past across subjects' performance on three 
inductive reasoning tasks commonly found in intelligence-testing batteries were 
probably due at least in part to comraunalities in models of response choice and 
information processing used in the three tasks. Results of the first experiment 
indicated that the Ruraelh.trt-Abrahamson (1973) mode? of response choice in analogi 
cal reasoning could be extended to series completions and classifications as well; 
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results of the second experiment Indicated that the Sternberg (1977a, 1977b) model 
of information processing in analogical reasoning could be similarly extended. 

The unities identified in this experiment have been demonstrated only for 
reasoning problems using a single type of content--raamraal names. In order to 
build a strong case for a unified account of inductive reasoning in the three 
tasks studied here, one would want to demonstrate communal ities across cont*jnts 
as well as within a single content. One of us is currently engaged in analyzing 
data from such research (Sternberg, Note 2), where the three tasks considered in 
this experiment are crossed with three different types of content— schematic- 
picture, verbal, and geometric. 

The ijnpleoentations of the models described in this research assumed a 
■ultldiraensional representation for information in the data base upon which infor- 
mation processing took place. This assumption limits the content domains to which 
the theories as implemented here can be expeditiously applied, since only semantic 

Q 

fields seem to yield clean, interpretable, and replicable dimensions. Previous 
research has shown that theory testing can be carried quite far through the use of 
semantic fields (e.g., Rips, Shoben, § Smith, 1973; Rumelhart § Abrahamson, 1973; 
Shepard, 1964; Shepard, ICilpatric, § Cunningham, 1975; Smith, Shoben, § Rips, 1974; 
Sternberg, Tourangeau, 5 Nigro, 1979; Tourangeau § Sternberg, in press). Never- 
theless, the constraint of using a semantic field must be seen as limiting the 
generality of the results. It must be emphasized that this constraint is a practi- 
cal one rather than a theoretical one, in the sense that a spatial representation 
might apply to items not falling into semantic fields, but merely be difficult to 
reproduce through the experimental and scaling techniques presently available to us. 
We agree with Hutchinson and Lockhead (1977) that a spatial representation provides 
a rather general form of representation for a variety of purposes, and with Fiollan 
Q C197S) that spatial and network models are mathematical Iv interchangeable. Rips, 
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Smith, ind Shoben (1975) have noted that the choice of representation serves a 
largely heuristic f met ion, and in the present research, the spatral representa- 
tion seeas to have served this function reasonably well. Although we believe 
that subjects use a spatial representation in solving these problems, we also 
believe that other representations are available to subjects as they solve these 
(and other) problems, and that subjects make use of these alternative represen- 
tations as needed. To illustrate, modeling of error rates in the first experi- 
ment reported here (and in a previously reported experiment, Sternberg, 1977b) 
reveals that an overlapping clustering model (Shepard § Arabie, 1979) fits the 
error data quite well for analogies and classifications. In classifying three 
animals such as a LION, TIGER, and LEOPARD as similar, for exaniple, it seems 
quite likely that subjects would cluster them as "ferocious jungle beasts" as 
well as identifying them according to their size, ferocity, and huraanness in 
a spatial representation. The cluster representation does not work well for 
series completions, where^ logically as well as psychologically, it would seem 
inappropriate. As mentioned earlier, some of the systematic unexplained variance 
in the modeling of the data in these experiments was probably due to alternative 
encodings of the mammal names that are not captured in the first three diaensions 
of a spatial model of representation. 

Another simplification in the present research is the assumption that an 
additive, serial model cf information processing describes the sequencing of infor- 
mation processing in the execution of strategy for solving induction items. In 
fact, it seems unlikely that subjects process information strictly in this manner. Gru 
din (in press) has prtsented evidence suggesting that in analogy solution, subjects 
sometimes infer the relation between A and C and map the relation between A and B, 
rather tnan the other way aroundl l/hitely and Barnes (1979) have presented evi- 
dence suggesting that there are at least some individual differences in the order 
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In which subjects process information in analogy solution. The work of Simon and 
Kotovsky (1963) also suggests foms that individual differences might take in 

• * 

the solution of series completions. ThjB approximation proposed in this article 
Seems like a reasonable start toward understanding reasoning in induction prob- 
lems, although there is certainly a long way to go in this quest. 

The present research goes far enough to suggest that there are important 
communal ities in the rules for response choice and the strategies for informa- 
tion processing used in three induction tasks, but it does not go far enough 
to Suggest just what psychological mechanisms underlie these communal ities. Pro- 
cesses such as "encoding" and "inference" need to be unpacked in order to deter- 
mine not just what is done, but how it is done. 

We have concentrated upon communal ities across induction tasks in the per- 
formance component of information processing, but we believe that another major 
source of communal ities across these and other tasks is to be found at the 
"oetacomponential" level of information processing (Sternberg, 1979, Note 3), 
where plans and decisions are made regarding what performance components will be 
used in information processing. The problem of isolating the metacomponents 
of information processing from composite task performance is currently being 
pursued by one of us (R.J.S.) in collaboration with Bill Salter. 

One of the most vvidely replicated findings in the literature on human intelli 
gence is that of a general factor of intelligence, whereby people who tend to 
perform well (or poorly) on one set of tasks also tend to perform well (or poorly) 
on other sets of tasks. Although the finding of a general factor can scarcely be 
disputed at this point in time, the psychological explanation for this finding has 
been a source of considerable debate (Hunt,. 1978; Spearman, 1927; Thomson, 1939; 
Thorndike, Bregman, Cobb, § Woodyard, 1928; Sternberg, 1979; Hunt, Note 4; Jensen, 
Note S; Sternberg, Note 3). The present article suggests some possible sources 
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of fenerality in tn important subset of intelligent behavior, that of inducing 
structure. 

In one respact, the major objective of the present research is^^onsistent 
with that of traditional factor-analytic research: Both kinds of research seek to 
understand sources of communalities in performance on different complex information- 
processing tasks. In another respect, however, th^, objective is quite different: 
The present research seeks these sources of communal\ty in dynamic information- 
processing constructs rather than in static factorial ones. In this respect, 
the research is similar in its major objective to the research of Carroll (1976). 
Hunt. Lunneborg. and Lewis (197S). and Sternberg C1977b). Factors are viewed as 
Interpretable in information processing terms, and hence as reducible to more 
elementary sources of individual differences. But just as it was necessary to 
understand factors in terms of the more basic information-processing components 
that constitute them, so is it now necessary to understand the psychological 
■echanisras that are used to effectuate these performance components. Research 
on metacomponents may prove to be a start in this direction, although it is too 
early to tell. To a large extent, this research, like factor-analytic research, 
will be successful to the degree that it stimulates the level of theorizing that 
will eventually subsume it as a special case. 
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Footnotes 

« 

This research was supported by Contract N0001478C002S from the Office of 

Naval Research to Robert J. Sternberg. We are grateful to nembers of Robert 

Sternberg's research group at Yale for valuable comments on an earlier version 

of the manuscript. Portions of this paper were presented at the annual meeting 

of the Society for Mathematical PsychoK^gy. Providence. August. 19-9. Requests 

for reprints should be addressed to Robert J. Sternberg, Department of Psychology, 

Yale University, Box llA Yale Station. New Haven, Connecticut 06S20. 
1 

We are grateful to Adele Abrahamson for supplying the analogies used in 
Experiment 1 of Rumelhart and Abrahamson (1973). The analogies were formed from 
Henley's (1969) scaling of 30 mammal names in a three-dimensional space. This 
■ultidimensional scaling resulted in three interpretable dimensions-size, ferocity, 
and humanness-.and Euclidean distances within the space were computed on the basis 
of distances between coordinates on these three dimensions. 

^The "correct" rank order, based upon distance from the ideal point, is 
1. MONKEY, 2. RACCOON, 3. LEOPARD, 4. CAMEL. 

''The "correct" rank order is 1. RACCOON, 2. DOG, 3. HORSE, 4. CAMEL. 
4 

The "correct" rank order is 1. DEER, 2. COW, 3. DOG, 4. MOUSE. 

The comparison component did not appear in the theory of analogical reasoning 
as originally presented (Sternberg, 1977a, 1977b). The model separating comparison 
from application was tested in one experiment (Sternbtrg, 1977b, Chapter 7), but 
was found not to perform as well as the model in which this separation did not occur. 
Hence, the model with the additional parameter was rejected. Whitely and Barnes 
(1979) have recently suggested that the separation should have been maintained. 
The present data are consistent with their data in suggesting the utility of the 
separation, and hence it is now introduced into the model, as suggested by l^itely 
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and Barnes, who refer to the component as "confirmation." 

^We initially performed modeling on both sets of data, but" because the only 
differences that were obtained were minor and seemingly due to the lessened relia- 
bility of the "corrects only" data set (from which error latencies had been removed) 

we discontinued this duplication of, effort. 
7 

In order for the expectation of the parameter estimates to be the same in 

all three tasks, it is also necessary that the same content and I'epresentatipn 

of infor.Bation apply across tasks. The content obviously was the same. There is 

some evidence of possible minor differences in representations (as discussed in 

the General Discussion of the paper) . 
3 

By "semantic field," we mean a set of terras that all fall into a single, 
clearly definable, semantic category, such as mammal names, bird names, presidents 
of the United States, brand names of automobiles, etc. 
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Subjectf' Rankings as a Function of Alternative Dir'.ance and Task 
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Note: Tabled values represent proportions of subjects assigning each rank 
to each answer option. Each proportion is based upon 900 observations 

(30 subjects X 30 items). 
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Table 2 

Mean Solution Latencies and Error Rates for Each Condition of Precueing 



Response Latencies 



All I teas 



Items Correctly 
Answered 





Uncued 


1 

Cued 


Uncued 


Cued 


Analogies 


8.51 


6.06 


8.28 


5.95 


Series Completions 


7.08 
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6.74 


4.99 


Classifications 


6.65 


4.29 


6.45 
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Error Rates 
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Uncued 


Cued 
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Analogies 


.23 
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.22 
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.26 


.24 
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Note: Response latencies are expressed in seconds. 
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Table 3 

Ptraaeter Estimates for Each Information-Processing Component in Each Task 



Parameter 



Task 



Parameter Estimate 



Encoding 



Analogies 

Series Completions 

Classifications 



1.22 
1.00 
.79 



Comparison 



Analogies 

Series Completions 
Classifications 



.13 
.14 
.J4 



Justification 



Analogies 

Series Completions 
Classifications 



.36 
;18 
.24 



Analogies 1.36 
Response** Series Completions 3.36 

Classifications 2.93 



Note: Parameter estimates, expressed seconds, are unstandardized linear 

regression coefficients. Comparison was estimated as a "time savings" 
for greater distance, but is expressed here in unsigned form. All 
coefficients are statistically significant at the 5% level or better. 

*The estimation of this parameter includes the response component latency plus 
any other latency constant across all item types. 
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Figure Captions 

Figure 1. Schematic diagrams showing rules for arriving'at ideal point, I, in 
each of three induction tasks. In analogies, I^ is located as the fourth vertex 
in a parallelogram having A, B, and C as three given vertices. In series com- 
pletions, I is located as the completion of a line segment that is at the same 
vector distance from B that B is from A. In classifications, l_ is the centroid 
of the triangle with A, B, and C as Vertices. The rules can be extended to n 
dimensions by assuming n-dimensional analogues to the two-dimensional figures 
depicted. In each type of problem, four answer options are presented at sue- 
cessively greater Euclidean distances from the ideal point. 

Figure 2. Predicted versus observed proportions of subjects ranking each 
of the four answer options as first, second, third, and fourth choice. Columns 
1, 2, 3, and 4 correspond to the first-, second-, third-, and fourth-choice 
data, respectively. Panel A is for Rumel hart "and Abraharason's (1973) 'analogy 
data; panel B is for the analogy data in the current experiment; panel C is for 
the series completion data in the present ^experiment ; panel D is for the 
classification data in the present experiment. The abscissa of each graph is 
the rank distance of each answer option from the ideal point; the ordinate is 
the proportion of subjects choosing each option. Predicted data are represented 
by Solid lines, and observed data are represented by broken lines. 
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